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ABSTRACT 

We investigate the promise of the Ly-a forest for high precision cosmology in the era of 
the Sloan Digital Sky Survey using low order N-point statistics. We show that with the 
existing data one can determine the amplitude, slope and curvature of the slope of the 
matter power spectrum with a few percent precision. Higher order statistics such as 
the bispectrum provide independent information that can confirm and improve upon 
the statistical precision from the power spectrum alone. The achievable precision is 
comparable to that from the cosmic microwave background with upcoming satellites, 
and complements it by measuring the power spectrum amplitude and shape at smaller 
scales. Since the data cover the redshift range 2 < z < 4, one can also extract the 
evolution of the growth factor and Hubble parameter over this range, and provide 
useful constraints on the presence of dark energy at z > 2. 



1 INTRODUCTION 



The study of the Ly-a forest has been revolutionized in 
recent years by high resolu tion measuremen ts using the 
Keck HIRES spectrograph dVogt et alJ ll994Tl and by the 
development of theor etical understanding using hydrody- 
namical simulations dCen et al.1 Il994t IZhang et al.1 Il995t 
iHernaui st et al.| |l996j: j Theuns et alJ I1998T) and analytical 
models llGnedin fc Hull99a) . The picture that has emerged 
from these studies is one in which the neutral gas responsi- 
ble for the absorption is in a relatively low density, smooth 
environment, which implies a simple connection between the 
gas and the underlying dark matter. The neutral fraction of 
the gas is determined by the interplay between the recombi- 
nation rate (which depends on the temperature of the gas) 
and ionization caused by ultraviolet photons. Photoioniza- 
tion heating and expansion cooling cause the gas density and 
temperature to be tightly related, except where mild shocks 
heat up the gas. This leads to a tight relation between the 
absorption and the gas density. Finally, the gas density is 
closely related to the dark matter density on large scales, 
while on small scales the effects of thermal broadening and 
Jeans smoothing have to be included. In the simplest pic- 
ture described here all of the physics ingredients are known 
and can be modeled. 

Within the model described above, the relation between 
the observed flux and the underlying dark matter is com- 
pletely defined, enabling one to study the dark matter cor- 
relations with the help of t he Ly man-Q forest. This was pro- 
posed first bv lCroft et all il99sh and has been subsequently 
investigated and applied t o the real data by several groups 
JCroft et alJll998l l2002bt iMcDonald etHllbood : iHui et alJ 
l200lT) ~This work established the simple flux power spectrum 
as the statistic of choice, although other statistics such as 



the flux probability dis tribution have also been investigated 
jMcDonald et alJl2000h . 

This paper addresses in more detail the question of how 
much information about cosmological parameters can be ex- 
tracted from the analysis of the Lyman-Q forest and what 
are the best statistics to use. We focus on N-point statis- 
tics (where N=2, 3, 4), since these provide the simplest 
parametrization of the long-range correlations. We use a 
Fisher matrix analysis to assess the uncertainties in cosmo- 
logical parameters for a given uncertainty in Lya forest mea- 
surements. To calculate the Fisher matrix, we must calculate 
the derivatives of the observables (flux power spectrum and 
bispectrum) with respect to cosmological parameters, cen- 
tered on a reasonable fiducial model. Using this approach, 
we show that higher order statistics add to and indepen- 
dently confirm the information gained from the flux power 
spectrum, and we address practical issues of such a study. 
For the data sample, we assume a sample with noise and 
resolution characteristics of the Sloan Digital Sky Survey, 
which currently contains a few thousand QSO spectra with 
measured Lyman-a forest [z > 2.2). 

Statistical correlations in the Lyman-Q forest are sensi- 
tive to several parameters of cosmological interest. Broadly, 
they are sensitive to the linear power spectrum of matter 
fluctuations on scales around 1 comoving h~ 1 Mpc. In this 
paper, this power spectrum Pl(/c, a) is expressed as 

P L {k,a) = AD 2 (a)fc n+(a/2)ln(fc/fcp) T 2 (fc). (1) 

The parameters that are being studied are the power spec- 
trum amplitude (« = lnV^L), the primordial slope n, and 
primordial curvature a. We also investigate the sensitivity 
to the linear growth factor D(a), where a = (1 + z) _1 is the 
expansion factor. 

Several additional parameters affecting absorption in 
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the Lyman-a forest must be included, such as those of 
the gas temperature-density relation, which for our redshift 
range is assumed to have the form 

T ^ = T -(l^) 7 " 1 - « 

T1.4 is used because ^McDonald et, alJ (j200ltl determined the 
temperature at this density most precisely. T1.4 and 7 are 
two of the parameters studied in this paper. In addition, the 
mean transmitted flux F(z) is also assumed to be a free pa- 
rameter (which can be related to the UV background and 
baryon density though the equation of ionizing equilibrium). 
Thus six parameters were varied at each redshift. Our ap- 
proach is to use hydrodynamical and N-body simulations, 
and vary all the parameters at each redshift. We are inter- 
ested in the sensitivity to cosmological parameters, so we 
marginalize over all the other parameters when presenting 
the results. 

The outline of the paper is the following. In section|2]we 
describe our HPM simulations and the hydrodynamic simu- 
lations that are used for comparison, introduce the statistics 
used in the course of the analysis, and discuss our choice of 
simulations, filters and instrumental effects for this study. 
Section 13 describes the procedure of calculating the Fisher 
matrix, and the determination of errorbars for the param- 
eters of interest. In section [I] we address the possibility of 
using the Lyman-a forest to determine quintessence den- 
sity Q q or equation of state w q , and more general deviations 
from the Einstein-de Sitter growth factor D(a) = a. After 
discussing the implications of the results in section |S] we 
conclude with a discussion of future work in this area. 



2 SIMULATIONS 

To assess the sensitivity of the Ly-a forest to cosmo- 
logical parameters we perform ed a Fisher matrix analy- 
sis (e.g. iTeemark et al.l il99^ '). We conducted a redshift- 
dependent analysis using 9 redshift bins, centered at z = 
2.2, 2.4, . . . , 3.8 and using a realistic redshift distribution of 
quasars as found in SDSS data. Since we only had simula- 
tions with values of a spaced by 0.04, we chose the closest a 
available. 

W e use Hydro-P M (HPM ) s imulations JGnedin fc Huil 
[ 19981: IMcDonald! l200ll : IMcDonald et al l l2002t 
ICen fc McDonald! 2002) rather than more accurate 
hydrodynamic simulations due to the very large number of 
simulations needed. The boxes used for the main analysis 
had length 80 /i _1 Mpc, with N = 512 3 particles, and 
spectra were averaged over all three axes. Pixelization, 
smoothing, and noise of the data were imitated in the 
simulations, with S/N = 5 and Gaussian smoothing a of 
length 0.625 comoving h~ 1 Mpc (same size as the pixels), 
as will be described in more detail in section 12.71 64 2 lines 
of sight into each face of the box were used to reduce 
computation time after finding that using 128 2 did not 
yield more information. 

The fiducial model used for the full Fisher analysis is 
7 - 1 = 0.3, T1.4 = 17000 K, n = 0.75, a = 0.0, Sl m ,oh = 
0.26, cr 8 = 0.75 and F{z) from 



a fit to the data in IMcDonald et alJ i200(t) . Note that n 
and a here are the primordial slope and curvature. The val- 
ues measured are n e g and a e g , which include a contribution 
from the transfer function. The value chosen for 7 — 1 is 
the center of the range of theoretical values, and two-sided 
derivatives were done in steps of ±0.03 (in other words, us- 
ing simulations with 7— 1 = 0.0 and 0.6). Likewise, the value 
chosen for T1.4 is observationally justifiable, and derivatives 
were taken in steps of ±5000 K. Derivatives in n about 0.75 
and a about 0.0 (a value consistent with observation as well) 
were both done in steps of ±0.05. Derivatives in F(z) were in 
steps of ±0.02. Instead of varying the actual power spectrum 
amplitude, which requires a large number of different simu- 
lations, we varied a. In relating derivatives such as AP/da to 
the desire d derivative s dP/d ft, we used the approximation 
(shown in iMcDonakH (l200lT) to be quite accurate) that z- 
evolution of the power spectrum can be treated as a rescaling 
of k and T1.4. The change in growth factor can be treated 
primarily as a change in k, along with a small change in 
T1.4 (fixed in velocity coordinates, it consequently differs in 
simulation coordinates). Two-sided derivatives with respect 
to a were calculated using steps of ±0.04. Then, by sub- 
tracting off this temperature term from the a-derivatives, 
we obtain the portion of the a-derivative that mimics a 
change in power spectrum amplitude evolution. Then we 
use dn = da /a to form the amplitude derivative (this form 
assumes an Einstein-de Sitter universe with D(a) — a). 

While the bulk of our simulations are HPM, we 
use the hydrodynamic simulations for comparison when - 
ever possible. These a r e described in ICen et"al] i200ll) ; 
IMcDonald et alJ J2002ft: ICen fc McDonald! fcOoW and for 
more detail. ICei^^taJ 1 J2002h . They have box size 25 
7i _1 Mpc, with 768 3 Eulerian cells for baryons and 384 3 
dark matter particles, and the cosmology is ACDM, with 
tt A = 0.7, fi m = 0.3, tt b = 0.035, a 8 = 0.9, h = 0.67, and 
n = 1. Median temperature at the mean density is around 
15,000 K, slightly decreasing with redshif t for 2 < z < 4, 
consis tent with observational constraints in lMcDonald et all 
(2000). The three outputs used have redshifts 1.9, 2.45, and 
3.0. The HPM simulations used for the comparison with hy- 
drodynamic simulations had a smaller box size than those 
used for the full analysis. The parameters were box size 
25 h~ 1 Mpc, 384 3 particles, the same cosmology as the hy- 
drodynamic simulations, and 7 — 1=0.6 (the temperature- 
density relation in the hydrodynamic simulations is not pre- 
cisely a power law, but the slope is near 0.6). The two had 
nearly identical initial conditions, minimizing the sampling 
variance. The missing mass that was concentrated into stars 
in the hydrodynamic simulations was neglected in the HPM 
simulations. In all cases, F — 0.7 was used for the com- 
parison, with 2-sided derivatives using increments of ±0.02. 
The simulations were binned so that the hydrodynamic and 
HPM simulations had the same size pixels. 

We now introduce the statistics used for analysis, and 
then use them to address several questions about these sim- 
ulations: accuracy in comparison with hydrodynamic simu- 
lations, sufficiency of the box size and resolution, the mag- 
nitude of random fluctuations due to finite box size, and the 
effects of chunking on analysis. We discuss all but the last 
of these issues only to determine the sufficiency of our simu- 
lations for a Fisher matrix study, rather than requiring the 
higher precision necessary for their use in a data analysis. 
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The final issue, chunking, is discussed since it may be done 
in a data analysis, so it is necessary to show that it does not 
affect the results. 



2.1 Statistics 

In general, the statistics were computed using the flux resid- 
ual Sf = F/F — l rather than F itself. Besides the flux power 
spectrum P with normalization 



k [km s -1 ] -1 
0.001 0.01 



k [km s-'l- 1 
0.001 L ' 0.01 



§P(k) = (5%), 



(4) 



several higher order statistics were studied. Rather than in- 
vestigate the full bispectrum and trispectrum information, 
we decided to use a more restricted form of these general 
statistics. In this sense our results will be conservative. The 
higher order statistics are created using the following proce- 
dure: 8f{\) is Fourier transformed to find 8p{k). Next, it is 
band-pass filtered to create the field 



8 H (k) = 8 F (k)W(k) 



(•») 



where W(k) is some filtering function. Two types of filters 
used in this analysis are a square window, 



W s (k) 



0, if k < ki or k > k-2 

1, if fei < k < k 2 



(6) 



and a Gaussian window characterized by some k and at. 

Next, the filtered field &(&) is inverse transformed to 
create the real-space counterpart, 8h{X). The field H(X) is 
created by squaring: 



H(X) = (S H (\)) 2 



(7) 



We define T as the power spectrum of H . Another statistic, 
B, is the cross-spectrum between 5f and H . B and T are 
related to the three- and four-point functions, respectively. 
However, because T is not the reduced four-point function, 
there is a significant covariance between it and P. 
In addition, the cross-correlation coefficient 



R = 



B 



(8) 



was studied by IZaldarriaea et all (l2001bT) due to its sim- 
ple physical interpretation. As shown there, it is expected 
that R computed from the flux residual should be signifi- 
cantly negative for low k, since perturbations in high den- 
sity (low flux) regions tend to grow faster. As shown in this 
paper, one utility of these higher order statistics is that they 
break degeneracies between the power spectrum amplitude 
and F as measur e d by P alone. Furthermore, as shown in 
IZaldarriaea et alJ fcOOlbl) . they may also be useful in dis- 
criminating between gravitational processes (which give neg- 
ative B and R) and continuum fluctuations (which give posi- 
tive B and R) or other extraneous effects such as metal lines, 
star-formation induced outflows of gas, inhomogeneity in the 
UV background, and the limitations of simulations. 

Fig. shows P, B, T, and R for the central point in 
parameter space around which variation occurs for z = 3.0 
(7-_l = 0.3, T1.4 = 17000 K, a = 0.24, n = 0.75, a = 
0.0, F — 0.68). Note that we vary a rather than the power 
spectrum amplitude itself (which involves generating many 
more simulations). By Eq. we have 5k — Sa/a for an 





0.03 



Figure 1. The values of the statistics for the fiducial model noted 
in the text. Errorbars are those expected for the amount of data 
in this redshift bin. The filter is Bl (see Table 0, a square filter 
from 0.2-1.1 /iMpc -1 . 



Einstein-de Sitter cosmology (D(a) = a). Errorbars shown 
are computed using the variances of the statistics for a single 
simulation and the length of spectrum for this redshift in the 
SDSS data available to us. 

Our study showed that T and R do not add much in- 
formation, since T is so closely related to P, and R is just 
a combination of the other statistics. Consequently, in the 
rest of this paper, only P and B will be used. Figs. |2j and 
[3] show the relative variation of these statistics with all 6 
parameters. Note that in this section, all plots of the val- 
ues of statistics or their derivatives were created using the 
same filter, a square filter from 0.2-1.1 /iMpc -1 (unless noted 
otherwise), and derivative plots have the same scale for easy 
comparison. 

We briefly explain the effects of changing the various 
parameters on P and B that are illustrated in Figs. [5] and [3] 
Raising the overall power spectrum amplitude causes an in- 
crease in P and B on all scales shown here, because the 
increase in power in the matter power spectrum carries di- 
rectly over to the flux power spectrum. Increasing n lowers 
P and B on large scales and raises it on small scales, directly 
reflecting its effects on the matter power spectrum (for even 
smaller scales than those shown here, increasing the power 
spectrum amplitude and n actually causes a decrease in P 
because non-linea r peculiar velocities suppress power, as in 
iMcDonaldl d200lli i. The statistics are nearly insensitive to 
a because it does not have much effect on these scales. P 
and B show only a very slight increase with T1.4. The de- 
crea se of these s t atistic s with an increase in 7 — 1, consistent 
with|McDonald (2001), can be explained by considering the 
optical depth r oc 7 , so that the exponent on pt 
is decreasing, and hence the fluctuations in optical depth 
for a given density fluctuation also decrease. The decrease 
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Figure 2. Plot of the relative variation of the statistics with n, 
a, and re for one square filter. 



Figure 4. Relative difference in P and B for hydrodynamic vs. 
HPM simulations, for three z values. 
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Figure 3. Plot of the relative variation of the statistics with F, 
7 — 1, and T1.4. 



sequently the flux power spectrum (hence the degeneracy 
between power spectrum amplitude and mean flux). 



2.2 Comparison of HPM and Hydro simulations 

Fig. 0] shows the relative difference between HPM and hy- 
drodynamic simulations at three redshifts, Fig.|S]shows their 
flux derivatives at z = 2.45, and Fig. [(J shows their ampli- 
tude derivatives at z = 2.45. 

For the derivatives, the relative change (e.g., SP/P) was 
used for comparison, because differences in the magnitude of 
the derivatives could be simply artifacts of the differences in 
the magnitudes of the statistics themselves. Since this kind 
of change will cancel out of a Fisher matrix calculation (see 
equation II IP . the relative change is plotted to emphasize 
changes in the fc-dependent shape of the derivatives rather 
than their amplitude. 

As illustrated by Figs. 14161 the HPM simulations yield 
statistics and derivatives fairly similar to those of the hy- 
drodynamic simulations. The worst discrepancies in Fig. 0] 
occur for z = 1.9, already below the range of this study. 
A Fisher matrix analysis using just amplitude and F was 
completed for these simulations, and found that the error 
bounds for both parameters were 6% lower when calculated 
with the HPM than with the hydro simulations, a tolerable 
difference for our purposes. 



in power with an i ncrease in t he me an flux is consistent 
with the results in iMcDonaldl i200lf) . and can be under- 
stood by the proportionality between the optical depth and 
^2-0.7(7-1) g ecause t ne increased mean flux is equivalent to 
a lower optical depth, we can see that this should have the 
same effect as lowering the density fluctuations, and con- 



2.3 Resolution and box size 

Another question about the HPM simulations is the box size 
and resolution necessary for a realistic study of these statis- 
tics. To check whether the 80 /i _1 Mpc box with 512 3 parti- 
cles is sufficient, we computed the statistics and their deriva- 
tives for a 40 fe _1 Mpc box with 512 3 particles (higher reso- 
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Figure 5. Relative difference in two-sided F-derivatives hydro- 
dynamic vs. HPM simulations, at z = 2.45, with SF = 0.02. 



Figure 7. Comparison of statistics for different box size and reso- 
lution, with filters 0.2-1.1 /i _1 Mpc (bottom) and 1.1-2.0 h _1 Mpc 
(top) as shown on plot. 
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Figure 6. Comparison of amplitude derivatives for hydrody- 
namic vs. HPM simulations, with F = 0.7 and 6k = 0.167. 



lution), and a 40 fo _1 Mpc box with 256 3 particles (same res- 
olution, different box size). The values of the statistics and 
their derivatives with respect to F and k, with 7 — 1 = 0.3, 
T1.4 = 17000 K, a = 0.24, n = 0.95, a = 0.0, and F = 0.7, 
with typical instrumental effects, are in Figs. Q |H1 and 
respectively. The filters are noted on the plot. As shown 
in Fig. |7| box size and resolution differences are generally 



within tolerance for P and B with the higher filter shown. 
Any small box size effects may be attributed to the differ- 
ence in random seed in the 80 /i _1 Mpc boxes versus the 
40 ft _1 Mpc boxes. There is a noticeable difference for B, 
the lower filter, between the higher and lower resolution 40 
/i _1 Mpc boxes. This difference cannot be attributed to a 
seed difference since the seed is same for these boxes. The 
difference is not evident between the 40 and 80 /i -1 Mpc 
512 3 boxes, because of either a box-size effect or random 
fluctuations that oppose this resolution effect. For the most 
part, the fc-dependent shapes of the derivatives agree, with 
the difference being one of magnitude. A smaller version 
of the Fisher analysis done in section was completed for 
these boxes of different resolutions to check the effects of 
the derivative differences. The errors on amplitude are about 
13% higher and on flux are about 15% lower with the lower 
resolution box than with the higher resolution box. How- 
ever, since we are only looking at error bounds, which should 
hopefully be within about 25% accuracy, this error is toler- 
able, and the difference in Figure|7|is (by this criterion) not 
significant. 

2.4 Random fluctuations due to finite box size 

Another question is the magnitude of the random fluctu- 
ations due to finite box size, which can be determined by 
using different random seeds to generate the initial ran- 
dom Gaussian field. A study was done using three seeds, 
with 7 - 1 = 0.3, T1.4 = 17000 K, a = 0.24, n = 0.95, 
a — 0.0, F = 0.68, and standard resolution and noise ef- 
fects, on the values of P, B, and their F- and K-derivatives. 
Plots of the relative difference between the statistics and 
their F-derivatives computed with individual seeds and av- 
eraged over three seeds are in Figs. llOlandllllrespectivelv. 
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Figure 8. Comparison of ^-derivatives (&F = 0.02) for different 
box size and resolution, as noted on plot. The choice of filters for 
B is the same as in previous plot. 



Figure 9. Comparison of re-derivatives (8k = 0.167) for different 
box size and resolution, as noted on plot. The choice of filters for 
B is the same as in previous plot. 



A plot of the relative difference between the statistics com- 
puted with individual seeds and averaged over four seeds for 
a 40 /i _1 Mpc box is in Fig. [HI 

As shown in these figures, the statistics and their deriva- 
tives for an 80 /i _1 Mpc box do not have very large fluctu- 
ations due to finite box size, so only one random seed (the 
same each time, for consistency) was used for this analy- 
sis rather than averaging over the statistics obtained from 
several seeds. To check concretely that this is not a prob- 
lem, a Fisher analysis for just re (derivatives not shown, but 
they have a similar pattern as the flux derivatives) and F at 
one z was completed using three different seeds, and using 
the spectra and covariance matrices computed by averaging 
over lines of sight from all three seeds. The F and k error 
bounds for individual seeds were within 0.015 (relative dif- 
ference) of the bounds for the averaged spectra. Note that 
the 40 /i _1 Mpc box has significantly higher fluctuations, as 
expected, so it is possible that apparent box size effects are 
actually due to these fluctuations. 



2.5 Chunking 

The simulations were also used to study the effects of chunk- 
ing, a procedure often used for data analysis. In the data, 
the spectra can be much longer than 80 /i _1 Mpc, and are 
cut into smaller chunks for analysis. The fields 8f{\) and 
H(X) are both created from the full spectrum in real space 
before chunking. 

To identify the effects of chunking, we compared the 
values of the statistics for an 80 ft _1 Mpc box with 512 3 
particles; and the same 80 /i _1 Mpc box, computed with half- 
lines of sight and averaged (see results in Fig. I13B . All other 
simulation parameters were identical: y — 1 = 0.3, T1.4 = 



17000 K, a = 0.24, n = 0.95, a = 0.0, F = 0.7, and typical 
noise and resolution effects. 

As shown in Fig. 1131 chunking has little effect on P and 
B, with the values of the statistics computed using chunk- 
ing almost perfectly matching those for the 80 h~ 1 Mpc box 
without chunking. Thus, the chunks of spectrum as small as 
40 /i -1 Mpc used in a data analysis can accurately represent 
the results in an infinite-spectrum limit. 



2.6 Filters 

As stated in section |2~T1 we tried two filter shapes and sev- 
eral filter sizes and n umbers of filters. The squ are window, 
in Eq.|S| was used in IZaldarriaga et alJ fcOOlbl) . The Gaus- 
sian window was also tried since the Fourier transform of the 
square filter is not well-localized. The sizes of the filters were 
chosen based on the following consideration: the k values in 
these 80 /i _1 Mpc boxes range from 0.078-5.0 /iMpc" 1 . As- 
suming tt m = 0.3, at z — 3, this range corresponds to 0.0007- 
0.05 s km -1 . Below about 0.001 s km -1 , continuum fluctu- 
ations become important, and above about 0.02 s km - , 
resolution and noise effects become very important. Con- 
sequently, for square filters, the largest range of filter used 
is 0.002-0.02 s km" 1 (0.2-2.0 ftMpc" 1 ), and several smaller 
filters were chosen. For Gaussian filters, the question of fil- 
ter width is a bit more difficult because there is some small 
contribution from beyond even 2a. Thus, we hoped to find 
the balance between the competing considerations of filter 
smoothness (compactness of the Fourier transform) versus 
its ability to eliminate fc-values outside the range we want 
to consider. For the data, other possibly useful filters that 
can balance the smoothness and noise considerations are a 
triangular filter, or a Gaussian filter with a sharper low-fc 
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Figure 10. Relative difference between the values of the statistics 
computed with three different seeds and the average value (filter 
shown on plot) in an 80 h- 1 Mpc box. 



Figure 11. Relative difference between the values of F- 
derivatives (<5F=0.02) computed with three different seeds and 
the averaged spectra (filter shown on plot) in an 80 /i _1 Mpc box. 



cutoff. We show results for several sizes of square and Gaus- 
sian filters. 

First we investigated how does an increase in number 
of filters with different sizes improve the information from 
B. A Fisher analysis using varying numbers of square filters 
was done to determine the number of filters to use in the 
range 0.2-2.0 /iMpc -1 . We tried using one or two filters by 
subdividing that interval accordingly. No filters overlapped, 
as analysis showed that overlap did not increase the amount 
of information at all. The error bars computed from one 
filter versus from two filters in the same range were within 
5% of each other. Thus one filter is sufficient for analysis 
of a given range of k. However, for this analysis we used 
two non-overlapping filters for the total range, allowing us 
to analyze separately the information contribution of lower 
and higher filter ranges. 

We also investigated filter sizes and shapes as shown in 
Table Filter Bl is the lower filter of the pair referred to 
as filter B, and B2 is the higher filter. The numbers shown 
are actually for z = 3; the filter parameters are constant in s 
km -1 , and so their value in hMpc" 1 is z-dependent. While 
the magnitude of the statistics are rather filter-dependent, 
the shape of the statistics is essentially filter-independent 
and so the results were almost independent of the filter size 
and shape used, at least within the range of filters explored 
here. 

2.7 Instrumental dependencies 

Noise and resolution play an important role in determining 
the sensitivity of a given data set. We parameterize the noise 
with two numbers, / c (the fraction of noise whose amplitude 
is correlated with the signal) and S/N (the overall signal to 
noise ratio relative to the continuum, including all noise). 
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o - 




-0.1 - 
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0.1 - 

- 

-0.1 - 
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k [h-'Mpc]-' 

Figure 12. Relative difference between the values of the statis- 
tics computed with four different seeds and the average value 
(filter shown on plot) for a 40 h~ J Mpc box. The reason for the 
systematic difference between the B values for different seeds is 
that these seeds gave higher or lower than average values of P for 
several k, and since those k were within the filter, that difference 
was spread systematically to all k. 
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on B and T. Uncorrelated noise contributes to P and T, but 
not to B, whereas correlated noise contributes to all three. 
The noise's contribution to P can be calculated analytically 
whether its amplitude is correlated or uncorrelated with the 
signal. The correlated noise contribution to B depends not 
only on the noise amplitude, but also on the signal power 
spectrum P. 

In this study, we tried to match the pixelization, res- 
olution, and noise to that of the SDSS data. The pixels in 
SDSS data are roughly 1 A wide (so the smoothing a is 
approximately the size of one pixel). Assuming z — 3 and 
fi m = 0.3 this corresponds to 0.6 h~ Mpc. To match the 0.6 
/i -1 Mpc pixels, we used pixels of size 0.625 h~ 1 Mpc in the 
simulations. This small difference from the real value makes 
little effect in the value of the statistics, since even reduc- 
ing the pixel size by a factor of four changes B and P by 
at most 5%. We used Gaussian smoothing a the same size 
as our pixels. SDSS data have S/N (averaged over redshift 
bins in our range) ranging from 1 to 30, with median values 
in each redshift bin ranging from 4 to 5. We used the value 
S/N = 5.0 for all redshift bins. The noise amplitude was en- 
tirely uncorrelated with the signal (f c = 0), since we could 
not evaluate the amount of correlated noise in the data. 



Figure 13. Plot of the statistics using full lines of sight from an 
80 h — 1 Mpc simulation, and a chunked 80 h _1 Mpc simulation. 
The code "C" refers to chunking. 





Filter 1 


Filter 2 






Square Filters 






Start 


End 


Start 


End 


B 


0.2 


1.1 


1.1 


2.0 


C 


0.4 


1.2 


1.2 


2.0 


D 


0.2 


1.0 


1.0 


1.8 


E 


0.4 


1.1 


1.1 


1.8 






Gaussian 


Filters 






k 




k 




F 


0.65 


0.2 


1.5 


0.3 


G 


0.65 


0.13 


1.5 


0.2 


H 


0.65 


0.27 


1.5 


0.4 



Table 1. Table of codes for the seven filters analyzed. All figures 



are in MVlpc at z = 3. 



Noise whose amplitude is correlated with the signal is char- 
acterized by its variance a\ — cF (where F is the mean flux 
from the quasar). A typical example is photon shot noise, 
where the variance is proportional to the flux. Noise whose 
amplitude is uncorrelated with the signal is characterized 
by its variance <j\ (for example, Gaussian readout noise and 
photon shot noise from the sky) These variables are related 
as follows, defining a unique c and crj for each f c and S/N: 



fc = 



cF 



cF + a'i 



S/N = — 
°7 



\J°l + cF 



(9) 
(10) 



The distinction between correlated and uncorrelated 
noise amplitude is made because they have different effects 



3 FISHER MATRIX CALCULATION AND 
RESULTS 

The construction of the Fisher matrices for each individual 
redshift bin is straightforward. A vector containing all values 
Xi that were to be used (for example, P and B for all k) was 
created, as was its full covariance matrix (computed using all 
12288 lines of sight) and its derivatives with respect to the 
6 parameters 7, T1.4, k, n, a, and F. The derivatives with 
respect to P were created from the values with the power 
due to noise subtracted off. These covariance matrices and 
derivatives were then used to create the 6x6 Fisher matrices 
for each redshift bin by computing 

T 



(11) 



dx \ j / dx 

Ay k ) \ dyi / 

so that F is the inverse covariance matrix for y, the vector of 
all the parameters. In general, there is another term related 
to dC/d?/i that we neglect in the expectation that it is small. 

Next, a block-diagonal 54 x 54 Fisher matrix was cre- 
ated, using the 6x6 Fisher matrices for the nine redshift 
bins. Each block was multiplied by the length of spectrum 
in that redshift bin in the SDSS data; the total spectrum 
length is 7.7 x 10 7 km/s. Fisher matrices in this form were 
created for all seven sets of filters listed in Table^ for var- 
ious combinations of statistics. Further information about 
error-bounds is extracted from these large Fisher matrices 
by projecting from them down to a smaller number of pa- 
rameters. 

3.1 Projection to fewer parameters 

To reduce to the smallest number of parameters, we used 
the following procedure. The 54 x 54 Fisher matrices are 
the inverse covariance matrices C _1 (y) for the values of the 
parameters in each redshift bin. We projected down to a 
smaller set of parameters that includes: 
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z 


F 


AF/ F 


2.2 


0.85 


0.0030 


2.4 


0.81 


0.0035 


2.6 


0.77 


0.0033 


2.8 


0.73 


0.0036 


3.0 


0.68 


0.0046 


3.2 


0.63 


0.0057 


3.4 


0.57 


0.0075 


3.6 


0.52 


0.011 


3.8 


0.46 


0.015 



Table 2. Error bounds on F(z) using P and B. 





P 


B 


PB 


A70 


0.097 


0.086 


0.054 


A71 


0.18 


0.17 


0.10 


AT1.4/T1.4 


0.096 


0.093 


0.081 


A«eff 


0.045 


0.035 


0.017 


An eff 


0.014 


0.058 


0.013 


Ao cff 


0.037 


0.063 


0.029 


AF1/F1 


0.0082 


0.0057 


0.0030 



Table 3. Errors using various combinations of statistics (note 
that F\ is the mean flux for the first redshift bin, and illustrates 
the trend in errorbars for all mean fluxes). 



• 70 and 71, the constant and linear terms of a linear 
expansion of 7(2) about z = 3.0, 

• T1.4, an overall redshift-independent value, 

• K ff, an overall effective amplitude at some z and k, 

• n e £f, an overall effective slope, where the evolution of 
the pivot point (fixed in velocity coordinates) is expressed 
via n(z) = n(k p ) + a In (k p (z)/k p ), 

• a c ff , an overall effective curvature, and 

• The nine values of F, one per redshift bin. 

This projection includes 15 parameters, and can be com- 
puted using Eq. 1111 

Here we present results for the analysis done with just 
filter B, using P and B. All references to "error bounds" 
implies that we have marginalized over other parameters 
(by using (F~ 1 )u rather than 1/y/Fu). 

First, the error bounds for 70 and 71, denned by [7 — 
l](z) =70 + 71(2 — 3), are 0.054 and 0.10, respectively. We 
are not very sensitive to T1.4, with a relative error of 0.08. 

Table|5|shows the relative errors for the nine values of F. 
The errors increase with redshift mainly because the amount 
of data decreases, and at low redshifts they are extremely 
small. 

The error on k = (AvC4)/-/A is .017. The errors on n e g 
and a e ff (denned at pivot point k = 0.009 s km -1 at z = 2.6) 
are An c fr = 0.013, and Aa c ff = 0.029. The error ellipse in 
terms of n e s and a e s computed from their joint covariance 
matrix is in Fig. 1141 To answer the question of whether B 
alone provides independent information competitive wih P, 
or whether B and P together is more sensitive than either 
of the two alone, we carried out the analysis using P alone, 
B alone, and P and B together. The error bounds for all pa- 
rameters for various combinations of statistics can be found 
in Table El 

Fig. H4l shows the 68% confidence level error ellipses for 
n a ff and a e s for P, B, and both together. It is clear that 
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Figure 14. 68% CL error ellipses for n e g and o G ff computed with 
various combinations of statistics. 

for this combination of parameters P provides most of the 
information. While B can act as a useful independent con- 
firmation, adding its information by itself does not decrease 
the statistical errors significantly. 

Fig. 1151 shows the 68% CL error ellipses for K e « and 
F(z = 3.0) for P, B, and both together. In contrast with 
Fig. 1141 combining the two significantly improves upon P or 
B alone, reducing the errors on k and F by almost a factor 
of 3. The primary reason for this is the partial degeneracy 
between the two parameters in P alone, as shown in Fia. 1151 
This degeneracy is easy to understand from the derivatives 
in Figs. [5] and [!J] Changing Korf has a relatively flat effect 
on the power spectrum P, so the two parameters are sig- 
nificantly degenerate from P alone, especially when all the 
other parameters are included. In contrast, the effect on B 
is much larger for k than for F. Combining P and B thus 
allows one to break the degeneracy between the two param- 
eters and determine both with a much higher precision. 

We also investigated how the errors vary with filter, 
finding that they did not vary significantly for filters given 
in Table In particular, the error bounds on n e ft and a e g 
were virtually independent of filter, and those on K e g were 
nearly so. Among the Gaussian filters, there was a slight 
trend towards lower error bounds on K e g for the larger filters 
(0.015 for the largest versus 0.018 for the smallest) but even 
here the difference is not very large. 

3.2 Expansions of K e ff, n e g, and a e s 

In order to determine how much variation in the values 
of n e fi, a e ff, and K e g (at a particular z and k) is induced 
by deviations in n, a. f2 m . Qb, and h, we used CMBFAST 
ijSeliak fc Zaldarriagall99rl) outputs to study the derivatives 
with respect to these parameters, using spectra that were 
normalized to the same amplitude at k = 0.05 /iMpc -1 
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0.66 - 
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K 

Figure 15. 68% CL error ellipses for K e ff and F(z = 3) computed 
with various combinations of statistics. Note that the correlation 
coefficient between these parameters is 0.80 for P, 0.45 for B, and 
O.fO for B and P together. The reason that the correlation is so 
reduced when B and P are used together is that there are nonzero 
covariancc matrix elements between B and P at the same k, so 
the Fisher matrix is not that produced by simply adding those 
for B and P together. 



Model 




ilq(Z = 2.0) 


Wo 


Wl 


1 


0.67 


0.12 


-0.7 


0.0 


2 


0.85 


0.28 


-0.7 


0.0 


3 


0.49 


0.06 


-0.7 


0.0 


1 


0.67 


0.04 


-1.0 


0.0 


5 


0.67 


0.30 


-0.4 


0.0 


6 


0.67 


0.12 


-0.8 


-0.2 


7 


0.67 


0.19 


-0.8 


-0.5 


8 


0.67 


0.27 


-0.8 


-0.8 


9 


0.67 


0.09 


-1.0 


-0.5 



Table 4. Codes for quintessence models, where w q = wq+wi (a — 
!)• 



However, Q m also changes the relation between velocities 
and comoving coordinates as given by the Hubble parame- 
ter H(z). This partially cancels the transfer function effect 
of f2 m , so that the coefficients in front of f2 m are reduced 
relative to h. Baryon density has a smaller effect than other 
parameters, at least around the small value of f2f,/f2 m we 
expanded here. 

It is clear that since we can only determine precisely 
two numbers, the amplitude and the slope, and somewhat 
less precisely also the derivative of the slope, we cannot de- 
termine all of the parameters with high precision. Hence it 
is best if the Ly-a forest is combined with other cosmolog- 
ical probes, such as cosmic microwave background (CMB) 
anisotropics. Since one expects CMB to provide very strong 
constraints on h and fi m (at least in the context of spatially 
fiat models), this then allows the Ly-cv forest to determine 
the primordial slope and its derivative with high accuracy. 



at 2 = 20. The derivatives were computed at z = 2.6, 
k — 0.009 s km -1 , the pivot point for SDSS data sample. 
We assume the fiducial model Q m = 0.3, h = 0.7, Qt = 0.04 
(or x = flb/Qm, = 0.13). The pivot point for a is k = 0.05 
/iMpc -1 . The expansion for K c g is denoted Are fr to em- 
phasize that this expansion is to be used to calculate what 
relative change in power spectrum amplitude would result 
from changing the cosmological parameters from our chosen 
values (hence it gives Aft c ff = for our cosmological model). 
The resulting expansions are 



Aftcff = Ak + 1.5(n - 0.75) - 0.032a 



(12) 



Wcff 



Qcff 



3.25 + n + 3.0a+ 



(13) 



0.161n 



0.3 



(,2,1, (A) +0 .034 in 



-0.22 + a-0.111n 



03 



(14) 



17111 1 oTr) +0 ' 0341n (ol3 



The effect on the amplitude is primarily determined by 
the primordial amplitude and slope, but not the derivative 
of the slope (given that the pivot point k — 0.05 hMpc~ 
is at much lower k than the wavevectors the forest is sen- 
sitive to). Today's matter density Q m and Hubble constant 
h = _ffo/100 km/s/Mpc change the transfer function and 
so affect the amplitude, slope and derivative of the slope. 



4 POWER SPECTRUM EVOLUTION AND 
QUINTESSENCE 

In this section we try to determine the sensitivity of the 
Lyman-Q forest to deviations from the Einstein-de Sitter 
(EdS) growth factor D(a) = a. We analyzed both specific 
quintessence models and also more general deviations of the 
growth factor from EdS. Sensitivity to quintessence model 
parameters can arise from the change in the growth factor 
D (z) and from the Hubble parameter H(z). In our tests the 
latter was subdominant and most of the sensitivity was from 
the growth factor. 

First, we studied the sensitivity of the Lyman-a forest 
to quintessence parameters, f2 9 and the equation of state 
w q , for nine different quintessence models. Several models 
considered had static equation of state w q — wo, while the 
other models were dynamic, w q = wo+wi{a— 1). The models 
studied are shown in Table^] All models had Qb = 0.04 and 
h = 0.7, with ficDM determined by requiring Qb + Qcdm + 



4.1 Projection with quintessence parameters 

To find constraints on all parameters once quintessence pa- 
rameters are included, we started from the 54 x 54 Fisher 
matrices created in section |S| and projected down to the pa- 
rameters in section ti. II plus Q q and w q (or Q q , wo, and wi 
for dynamical models) . All error bounds are computed from 
the Fisher matrix for filter B using P and B. To do so, we 
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Model 


Aflq/Qq 


Ak cB 


An cS 


A „ 

Aa cB 


Model 


Aflq/flq 


A . 


A ^ 


1 


0.26 


0.041 


0.057 


0.052 


1 


0.14 


0.024 


0.039 


2 


0.05 


0.073 


0.047 


0.034 


2 


0.04 


0.059 


0.043 


3 


0.48 


0.028 


0.075 


0.053 


3 


0.26 


0.018 


0.049 


4 


0.31 


0.017 


0.097 


0.078 


4 


0.11 


0.017 


0.043 


5 


0.13 


0.025 


0.049 


0.038 


5 


0.10 


0.019 


0.041 


6 


0.16 


0.037 


0.053 


0.049 


6 


0.09 


0.022 


0.037 


7 


0.11 


0.048 


0.040 


0.039 


7 


0.08 


0.034 


0.034 


8 


0.09 


0.063 


0.035 


0.033 


8 


0.08 


0.051 


0.033 


9 


0.20 


0.030 


0.066 


0.052 


9 


0.11 


0.019 


0.044 



Table 5. Error bounds for all quintessence models with fixed 
equation of state, marginalized over other parameters. 



Table 6. Error bounds for all quintessence models with fixed 
equation of state and a, marginalized over other parameters. 



computed derivatives of k c h, n a g, and a c a with respect to 
quintessence parameters at some pivot point fixed in s km" . 
These derivatives were computed from CMBFAST outputs 
for the 9 z-bins for all 9 quintessence models studied, nor- 
malized to have the same amplitude for z = 20 at the same 
pivot point at which derivatives were taken. 

Results for all models indicated that w q and Q q are 
highly degenerate, with a correlation coefficient of -0.9. The 
well-determined direction for model 1, which is fairly repre- 
sentative, in (w q ,Qq) space is (0.60, 0.80). The strong corre- 
lation is not surprising, since the redshift range probed is too 
small to determine two parameters from the growth factor 
or H{z) evolution. Some information on Q q also comes from 
the power spectrum shape, which is why the correlation is 
not perfect. In the following we thus fix one parameter, for 
example Q q , and present errors on equation of state only. 
The motivation for this is that other tests, most notably 
CMB combined with large scale structure tests (e.g. clus- 
ter counts, galaxy clustering, weak lensing), will be able to 
determine Q q at z = very accurately. If these tests also 
provide independent constraints on wq then for the time de- 
pendent w one can use our results to constrain Wi. 

The errors for fixed w q (marginalized over all other 
variables) can be found in Table The 68% CL error el- 
lipses show how Qq and amplitude are degenerate (Tig. ll6|l . 
Analysis of the covariance matrices for these model param- 
eters (or the fact that the error bounds for K e s, n e ff, and 
a e ff have risen from their value computed without includ- 
ing quintessence) indicated a very high degree of degener- 
acy between Q q and K e g, n e g, and a e g. If a is fixed then 
the errors on Q q consequently decrease (though this effect is 
much greater for the models to which we are least sensitive). 
This indicates that at least some of our information about 
quintessence parameters comes from shape information. The 
degeneracy with K c g indicates that some information is from 
the growth factor information as well, since the magnitude 
of the growth factor (as opposed to its shape) is degenerate 
with K e ff . The errors with w and a fixed are in Table [5] 

Error bounds on w q for static models, with Q q fixed 
(as may be done by measurements other than the Lyman-a 
forest), are shown in Table|7| As shown there, now that Q q is 
fixed, n G ff and a e ff have error bounds roughly equal to those 
computed in the previous section with no quintessence. This 
fact shows that these parameters contain information about 
Q q but not w q . KeB, on the other hand, is degenerate with 
w q , indicating that w q is detected via its influence on the 
growth factor. Consequently, the model with least negative 
w q has the best error bounds. Table [S] shows error bounds 



0.05 " 




Figure 16. 68% CL error ellipse for Q q and amplitude, with w q 
(but not o c ff ) fixed, for cosmological constant (model 4) as dashed 
lines, quintessence with w = —0.4 (model 5) as solid lines, and a 
dynamic quintessence model (model 8) as dotted lines. 



for wi with both Q q and Wo fixed. Errors for n e fr and a e ff are 
not shown since they are the same as in the previous section, 
without quintessence. Not surprisingly, the error bounds are 
best for the model with the most rapidly-varying equation 
of state. 



Model 


Aw q 




An e ff 




1 


0.22 


0.10 


0.016 


0.029 


2 


0.09 


0.10 


0.014 


0.028 


3 


0.36 


0.08 


0.016 


0.029 


4 


0.36 


0.06 


0.013 


0.029 


5 


0.17 


0.25 


0.025 


0.028 



Table 7. Error bounds for static quintessence models with fixed 
Q g , marginalized over other parameters. 
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Model Ami Are eff 



6 


0.31 


0.06 


7 


0.22 


0.07 


8 


0.18 


0.08 


9 


0.41 


0.06 



Table 8. Error bounds for dynamic quintessence models with 
fixed Q q and wq, marginalized over other parameters. 



odel 


Q g (z = 2.6) 


As 


Detect? 


1 


0.12 


-0.085 


No 


2 


0.28 


-0.22 


Yes 


3 


0.06 


-0.042 


No 


1 


0.04 


-0.024 


No 


5 


0.30 


-0.23 


Yes 


6 


0.12 


-0.082 


No 


7 


0.19 


-0.14 


Yes 


8 


0.27 


-0.22 


Yes 


9 


0.09 


-0.060 


No 



Table 9. Values of As for the nine quintessence models, where 
detectability of growth factor deviation is denned by |As| > 0.11. 
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4.2 More general growth 

Besides considering specific quintessence models, we also 
considered more general power spectrum evolution, modeled 

as 

D(a) = Do{a/a ) s , (15) 

where s = 1 corresponds to an Einstein-de Sitter universe. 
Thus, we can determine if our sensitivity to quintessence is 
due to growth factor deviations. This analysis was completed 
by starting with the 54 x 54 Fisher matrix for P and B (filter 
B), and projecting down to the same set of parameters as 
in Section along with s. The projection for k and s used 
P oc e 2K D 2 (a) to relate the necessary derivatives AP/Ak 
and AP/As to the computed values AP/Aa. Using the value 
of ao determined by requiring that amplitude and s have the 
minimum degeneracy, and inputting a value of s = 1, yields 
an error As = 0.11. Note that if only information from the 
power spectrum P is used (rather than including B as done 
here), the error rises to As = 0.47. 

Values of As for arbitrary static quintessence mod- 
els were computed using an integr al expression for 
D(a) obtained from iHamiltonl (1200 ll) . Taylor expand- 
ing to several orders, which gave a form D(a) = 
a (l + X) n =i c n (Qq,w q )a~ 3nWq y Expanding this expression 
about ao gave an expression for s, and hence for As = s — 1, 
in terms of the model parameters. Requiring | As| < 0.11 de- 
fines the region in w q -£l q parameter space that is detectable 
using the Lyman-a forest (growth factor information only, 
not curvature). This curve is shown in Fig. 1171 

To check the validity of this expansion, the growth fac- 
tor was computed directly from CMBFAST outputs for the 
9 quintessence models studied above. This also allowed the 
computation of As for dynamic models, for which an an- 
alytic expression was not obtained. Values of As for the 9 
models studied earlier, and values of Q q (z = 2.6), are shown 
in Table [5] As shown, As and = 2.6) are highly corre- 
lated. 



Figure 17. Curve in the Q q , w q plane indicating regions for 
which the deviation from D(a) = a would be detectable using 
growth factor information (1 and 2cr as solid and dashed lines, 
respectively). Note that the Gaussian approximation is worse for 
the 2cr errors. 



4.3 Change in H(z) 

So far the only tests of dark energy proposed have used ei- 
ther the growth factor or the redshift luminosity distance. 
If one wishes to extract w(z), then both of these involve a 
double integral over this quantity and degeneracies arise. A 
more direct and still in principle observable way is to mea- 
sure the Hubble parameter H(z), which is related to w(z) 
through a single integral. One way to measure it is to have 
a characteristic feature fixed in comoving coordinates which 
is observed in redshift space. The relation between redshift 
space and comoving space is determined by H(z) and so 
observing the feature as a function of redshift determines 
its evolution. The problem of course is that there are no 
characteristic features imprinted, since the distribution of 
structure in the universe is stochastic in nature. 

One must therefore look for a characteristic scale in 
correlations between structures. In principle such features 
could be provided by baryonic oscillations imprinted in the 
matter power spectrum, but in practice this is a weak effect 
limited to very large scales and so cannot be made very pre- 
cise. One is thus left with the variations in the correlation 
function slope as a function of scale. The slope varies from 
n ~ 1 on large scales to n ~ —3; on Ly-a forest scales we 
find Qfcff = —0.22 in CDM models. Hence the scale at which 
the slope takes a specific value can be viewed as a standard 
ruler and can be traced with redshift. If this slope is mea- 
sured in redshift space, then one is measuring directly H(z). 
To be able to do this one has to detect the curvature in the 
slope over the dynamic range of observations. This is chal- 
lenging, since the dynamic range is narrow and the error on 
the slope will be large. We find that such a detection should 
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be possible with the current sample, but is not expected to 
improve the constraints on w(z) significantly. 



5 DISCUSSION AND CONCLUSIONS 

Our results show that the statistical precision one can 
achieve from Ly-a forest using SDSS data is truly impres- 
sive. The overall amplitude of the power spectrum, its slope 
and its curvature can all be determined with 1-3% precision. 
Our results are significan tly more optimistic than those in 
Zaldarriaga et all ll2001aD . which focused on a higher reso- 
lution but smaller data sample available from Keck. This 
is primarily a consequence of the large sample size of SDSS 
data, which increases the statistical precision on larger scales 
and moves the pivot point from 0.03 to 0.01 s km" 1 . On 
larger scales the sensitivity of the power spectrum to the 
gas temperature-density relation is reduced so one is mea- 
suring more directly the underlying matter power spectrum. 
In addition, we have shown here that higher order correla- 
tions break the degeneracy between the mean flux and the 
amplitude (this degeneracy is broken to some extent already 
by the power spectrum alone). Using the power spectrum 
alone and the bispectrum alone are two independent ways to 
confirm this determination. Using them together gives even 
lower error bounds due to the breaking of degeneracies. The 
statistical precision of such a data set is competitive with 
the one from the CMB using the MAP and Planck satel- 
lites. More importantly, combining the two probes will allow 
one to determine the primordial power spectrum to a high 
precision over 3 decades in scale (0.001 < k < 2 /iMpc" 1 ). 

Furthermore, since one is measuring the power spec- 
trum over a range of redshifts (2 < z < 4), one can also study 
the growth factor evolution. While for standard cosmological 
constant models the universe is effectively Einstein-de Sitter 
for z > 2, dynamical models with rolling scalar fields often 
produce an equation of state increasing with redshift (this 
is especially true for many of the so-called tracker models). 
In this case the dark energy or quintessence is still dynam- 
ically important for z > 2 and can affect the growth rate 
of perturbations and the Hubble parameter H(z). While 
there is no simple single parameter combination that de- 
scribes the sensitivity to dark energy, it is clear that the 
precision is correlated with Q q at z > 2. Our results show 
that if Q, q (z — 2.6) > 0.2 then the deviations in the growth 
factor are sufficiently large to be detected in Ly-a forest 
spectra using the current SDSS sample. With the full SDSS 
sample this limit can be improved further and models with 
Q. q (z — 2.6) > 0.1 should be detectable. This sensitivity can 
increase further if one can extend the available data set to 
even lower redshifts by using space based observations. 

While it is clear that the statistical power of upcoming 
data sets is impressive, the main remaining issue is whether 
the simplest model of the QSO absorption adopted here is 
valid, or whether there are other more complicated astro- 
physical processes that can spoil the picture. Since the ex- 
pected statistical precision is so high one must investigate 
processes that affect the statistics even at a very small (1%) 
level. There are several possible complications that may have 
an effect and we mention some here: metal line absorption 
along the line of sight (e.g. C-IV, Si-Ill...), an inhomoge- 



neous UV background, kinematic gas outflows generated by 
supernovae from the galaxies, temperature fluctuations, etc. 

Fortunately, while there are several possible contamina- 
tions, there are also several possible tests one can apply to 
identify and remove the contamination. Some of these pro- 
cesses can be investigated with the statistics used in this 
paper, such as the power spectrum and bispectrum. In the 
simplest model the main driver of correlations is gravity, 
which induces a very specific relation between low and higher 
order statistics. Any astrophysical contamination is likely 
to destroy these relations. Other tests such as the proba- 
bility distribution of the flux (as a function of scale ) will 
also yield useful information jMcDonald et all 12000.) . An- 
other way to investigate these effects is through the cross- 
corre lation between the fores t and galaxies at the same red- 
shift iAdelberger et a l.l2003j); for theoretical at t empts to in- 
terpret this result, see McDonald et al.1 J2002I). ICroft et al] 
<2002^ . lKollmeier et al.1 (120021) and lBruscoli et alJ (12002^ 

If the simplest model passes this and other high pre- 
cision tests it will receive an important confirmation that 
will strengthen the credibility of the results. In the opposite 
case one can still identify the contaminant and apply the 
corrections assuming the contamination is sufficiently well 
understood. This parallels the situation in the CMB, where 
secondary processes and foregrounds are searched for and, 
if identified, subtracted from the primary CMB. It is clear 
however that a validation of the interpretation set forth by 
the simplest model will require a lot of coordinated effort 
from several groups and more effort should be put into this. 
Our results suggest that this is well worth the effort and that 
the Ly-a forest could be our next cosmological gold-mine. 
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